

# University of California College of Engineering Department of Electrical Engineering and Computer Science

Jan M. Rabaey

TuTh 9:30-11am

## EECS 141: SPRING 03—FINAL

For all problems, you can assume the following transistor parameters (unless otherwise mentioned):

NMOS: 
$$V_{Tn} = 0.4V$$
,  $k_n' = 115 \mu A/V^2$ ,  $V_{DSAT} = 0.6V$ ,  $\lambda = 0$ ,  $\gamma = 0.4V^{1/2}$ ,  $2\Phi_F = -0.6V$  PMOS:  $V_{Tp} = -0.4V$ ,  $k_p' = -30 \mu A/V^2$ ,  $V_{DSAT} = -1V$ ,  $\lambda = 0$ ,  $\gamma = -0.4V^{1/2}$ ,  $2\Phi_F = 0.6V$ 

| NAME | Last | First |
|------|------|-------|
|      |      |       |
| SID  |      |       |

**Problem 1 (10):** 

**Problem 2 (15):** 

**Problem 3 (14):** 

**Problem 4 (18):** 

**Problem 5 (18):** 

Total (75)

### **Problem 1: Timing and Clocking (10 pts)**

In order to boost profits, Intel has decided that their next-generation microprocessor has to have ultimate performance. To achieve the desired performance, 16 processors are integrated on the same die (the chip is hence called *seidecium* – for obvious reasons). The designer of the clocking architecture has come up with the strategy shown in the Figure below. A single clock signal is distributed over the complete chip. Three levels of buffering are used as shown by the black boxes in the Figure.



FIG. 1 Seidecium processor clock distribution network. The numbers annotated on the figure indicate the lengths of the wiring segments (in cm).

a) Determine the maximum skew between the different processor modules. (4 pts)

max skew:

| b) | The goal of the designers is to reach a 4 GHz clock speed. Determine the maxi-     |
|----|------------------------------------------------------------------------------------|
|    | mum delay of the logical function blocks given that 20% of the clock period is due |
|    | to the delay of registers. Also, note that the maximum internal skew within a pro- |
|    | cessor module equals 20 ps. (3 pts)                                                |

t<sub>logic</sub>(max)=

c) The Intel designers forgot to account for one thing though. Due to the parameters variations over the die, it is observed that the delay of the clock buffers can vary over 25% (in both positive and negative directions). Determine the worst-case clock speed due to these variations. (3 pts)

f<sub>clock</sub>(min)=

## Problem 2: Interconnect (15 pts)

a) A driver-receiver pair in CMOS technology is shown in Figure 2.



FIG. 2 Driver and receiver. Numbers on transistors indicate (W/L) ratios.

Assume that all transistors are short-channel devices.  $V_M$  of the driver inverter equals  $V_{DD}/2$ . Draw the voltage transfer characteristic  $V_{OUT}$  versus  $V_{IN}$  when the driver is **directly** attached to the receiver. Write down circuit analysis equations and calculate the break points on the VTC. (4 pts)

b) The driver and receiver in a) can be used to drive intermediate circuits. Briefly comment on the advantage and disadvantages of this driving scheme from the perspectives of performance and power. (3 pts)

c) Derive a global expression of the **typical gate** (being an inverter) delay in the presence of wiring with a length equal to  $L_{net}$  followed by a fanout of 4 equivalent gates. Make sure to include all components of delay. You may assume that the following parameters are given:  $C_{gate}$  and  $R_{on}$  of driver (per unit width),  $r_{int}$  and  $c_{int}$  of interconnect (per unit length). You may assume that the diffusion capacitance at the output of the gate is approximately equal to its gate capacitance. Clearly state all other assumptions you are making (e.g. wire model). This question is not related to parts a) and b). (4 pts)

d) Discuss how you would reduce the delay if the capacitive load of the fanout is the dominant factor and discuss the minimum value of the delay. (2 pts) e) Discuss how you would reduce the delay if the interconnect delay is the dominant factor. Derive an expression for the minimum delay. (2 pts)

### Problem 3: Memory (14 pts)

The Figure below shows a novel 2T-DRAM cell to be used in a low-voltage application. The supply voltage is fixed at 1 V. WBL is the write bit-line, RBL is the read bit-line, WL the word-line. Assume initially that node P is fixed at GND.



FIG. 3 A 2-T DRAM cell.

a) Determine the signal levels (V<sub>DD</sub> or GND) that have to be applied to the control signals (WS, RS) to perform a write operation into and a read operation from the cell? (4 pts)

WS (Wr):

RS (Wr):

WS (Rd):

RS (Rd):

b) Explain why this scheme has some major problems. (2 pts)

c) Instead of node P being fixed at GND, we apply a waveform as shown in figure below. Fill in the timing diagrams for the write operation. Denote the voltage levels in terms of  $V_{DD}$  and  $V_{T}$ . Assume there is enough time to let the transient effects settle out (no need to draw them). Explain why this approach is substantially better. (6 pts)



This aproach is better because:

d) Does the memory cell require refresh? Why or why not? (2 pts)

## **Problem 4: Multivibrator Circuits (18 pts)**

a) Shown in the Figure below is a design of a **Schmitt trigger**. Determine the (W/L) ratio of transistor  $M_1$  so that  $V_{M+} = 3V_{Tn}$ .  $V_{DD} = 2.5V$ . You may ignore the body effect in this question. You may also assume a long channel device. Clearly state your other assumptions. **(6 pts)** 



FIG. 4 Schmitt trigger. Numbers on transistors indicate  $(W\!/L)$  ratios.

 $(W/L)_1=$ 

b) Determine approximately the value of  $V_{M\mbox{-}}$  (4 pts)

c) Figure 5 shows an astable multivibrator. Calculate and draw voltage waveforms (see next page) at the capacitor  $V_C$  and at the output  $V_{out}$ . (6 pts)



Assumptions:

Ideal amplifier with symmetric supply

$$(V_{out}^{max} = V_{cc}, V_{out}^{min} = -V_{cc})$$

$$R_1 = 1k\Omega$$
,  $R_2 = 3k\Omega$ ,  $R_3 = R_4 = 4k\Omega$   
 $C = 1 \text{ nF}$ ,  $V_{cc} = 5V$ ,  $V_D = 0.6V$  (ideal diode)  
 $V_{out} (t = 0^-) = -V_{cc}$ 

FIG. 5 Astable multivibrator.



d) What is the oscillation frequency of the multivibrator in Fig. 5? (2pts)

f<sub>OSC</sub>=

#### Problem 5: Scaling and Overall Knowledge (18 pts)

a) Determine the region of operation (Cut off, Linear, Saturation, Vel. saturation) in the following configurations. You may assume that all transistors are short-channel devices and have identical sizes. Ignore body effect.  $V_{DD} = 2.5V$ .

Explain your reasoning, and show your derivations if needed (5 pts).

$$V_{DD}$$

$$GND \rightarrow M1$$

$$GND$$





b) The first row of the table given below lists the characteristics of a successful microprocessor designed for desktop systems. A low power version for portable use is desired and several changes are therefore made to the design. Use simple hand calculations to fill in estimates for blank cells in the table. Use the space below to explain your answers (if needed). All transistors exhibit short-channel I-V characteristics. (7 pts)

|                       | V <sub>DD</sub> /V <sub>T</sub> (V) | W, L, t <sub>ox</sub> (relative) | C <sub>ext</sub> * (nF) | C <sub>int</sub> (nF) | I <sub>sat</sub> (mA) | Ext. clock<br>(MHz) | Int. clock<br>(MHz) | Power (W) |
|-----------------------|-------------------------------------|----------------------------------|-------------------------|-----------------------|-----------------------|---------------------|---------------------|-----------|
| Original              | 2.5 / 0.4                           | 1                                | 3                       | 1                     | 1                     | 300                 | 900                 | 8         |
| Reduced Voltage       | 1.25 / 0.2                          | 1                                | 3                       | 1                     |                       |                     |                     |           |
| Reduced Dimension     | 1.25 / 0.2                          | 0.6                              | 3                       |                       |                       |                     |                     |           |
| Low cap.<br>Packaging | 1.25 / 0.2                          | 0.6                              | 2                       |                       |                       |                     |                     |           |

<sup>\*</sup> External C is dominated by packaging. Internal C is dominated by transistor gate cap.

- c) For each of the following statements, indicate whether it is true or false (circle one answer). (6 pts; 0.5 for correct answer; -0.25 for wrong one)
- **T F** (a) The speed of a ring oscillator can continuously be improved by increasing the W/L ratio of the inverters.
- T F (b) Decreasing supply voltage helps to alleviate the velocity-saturation problems.
- T F (c) The load capacitance of a static CMOS gate has no effect on its VTC.
- **T F** (d) A Φn-block dynamic gate will not have any charge sharing problems if only 0->1 transitions occur at its inputs during evaluation
- **T F** (e) The transistors in a Manchester carry chain should be sized progressively larger from the input to output to reduce the propagation delay.
- **T F** (f) Low-swing buses save power and reduce propagation delay at the same time.
- T F (g) The delay of a static inverter is minimized if  $(W/L)_p/(W/L)_n$  is equal to  $\mu_n/\mu_p$ .
- **T F** (h) Silicided poly lines reduce the delay of a wire by decreasing the capacitance.
- T F (i) The minimum propagation delay between two latches determines if a race condition will occur due to clock skew.
- **T F** (j) A 3-transistor DRAM cell requires a sense amplifier for the cell to be functional.
- T F (k) A NAND-based ROM structure is typically more compact and faster than a NOR-based one.
- ${f T}$   ${f F}$  (l) Given a fixed characteristic impedance  $Z_0$ , the delay of a transmission line is reduced when using Copper instead of Aluminum as interconnect material